Investigating the role of the Lombard reflex in visual and audiovisual speech recognition

نویسندگان

  • Panikos Heracleous
  • Miki Sato
  • Carlos Toshinori Ishi
  • Norihiro Hagita
چکیده

This study focuses on the analysis of the Lombard effect in visual and audiovisual speech recognition. Previous studies have shown that the performance of an audio-only automatic speech recognizer decreases in noisy environments because of the Lombard reflex. A few studies have considered the visual changes due to the Lombard reflex, but the role of the Lombard reflex in automatic visual speech recognition has not been investigated so far. The authors show that the Lombard reflex plays an important role not only in audio, but also in automatic visual speech recognition, and this factor should be considered while designing a robust audiovisual speech recognizer.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Audiovisual processing of Lombard speech

Perception results are presented that address the role of Lombard speech in auditory and audiovisual speech perception. Basically, visual enhancement neutralizes the advantage of Lombard speech observed for auditory perception. It remains an open question whether or not Lombard speech is preferable for perception studies of speech in noise.

متن کامل

Investigating the role of the Lombard reflex in non-audible murmur (NAM) recognition

In this paper, we report non-audible murmur (NAM) recognition results in noisy environments and investigate the effect of the Lombard reflex on non-audible murmur recognition. Non-Audible murmur is speech uttered very quietly and captured through body tissue by a special acoustic sensor (e.g., NAMmicrophone). A system based on non-audible murmur recognition can be applied in cases when privacy ...

متن کامل

Audiovisual Lombard speech: reconciling production and perception

An earlier study compared audiovisual perception of speech ’produced in environmental noise’ (Lombard speech) and speech ’produced in quiet’ with the same environmental noise added. The results and showed that listeners make differential use of the visual information depending on the recording condition, but gave no indication of how or why this might be so. A possible confound in that study wa...

متن کامل

Visual information and redundancy conveyed by internal articulator dynamics in synthetic audiovisual speech

This paper reports results of a study investigating the visual information conveyed by the dynamics of internal articulators. Intelligibility of synthetic audiovisual speech with and without visualization of the internal articulator movements was compared. Additionally speech recognition scores were contrasted before and after a short learning lesson in which articulator trajectories were expla...

متن کامل

Audiovisual Segregation in Cochlear Implant Users

It has traditionally been assumed that cochlear implant users de facto perform atypically in audiovisual tasks. However, a recent study that combined an auditory task with visual distractors suggests that only those cochlear implant users that are not proficient at recognizing speech sounds might show abnormal audiovisual interactions. The present study aims at reinforcing this notion by invest...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010